Decoding Cursive Scripts
نویسندگان
چکیده
Online cursive handwriting recognition is currently one of the most intriguing challenges in pattern recognition. This study presents a novel approach to this problem which is composed of two complementary phases. The first is dynamic encoding of the writing trajectory into a compact sequence of discrete motor control symbols. In this compact representation we largely remove the redundancy of the script, while preserving most of its intelligible components. In the second phase these control sequences are used to train adaptive probabilistic acyclic automata (PAA) for the important ingredients of the writing trajectories, e.g. letters. We present a new and efficient learning algorithm for such stochastic automata, and demonstrate its utility for spotting and segmentation of cursive scripts. Our experiments show that over 90% of the letters are correctly spotted and identified, prior to any higher level language model. Moreover, both the training and recognition algorithms are very efficient compared to other modeling methods, and the models are 'on-line' adaptable to other writers and styles.
منابع مشابه
Chapter 0 Stroke - Based Cursive Character Recognition
In this chapter, we keep focusing on on-line writer independent cursive character recognition engine. In what follows, we explain the importance of on-line handwriting recognition over off-line, the necessity of writer independent system and the importance as well as scope of cursive scripts like Devanagari. Devanagari is considered as one of the known cursive scripts [20, 29]. However, we aim ...
متن کاملThe Optical Character Recognition for Cursive Script Using HMM: A Review
Automatic Character Recognition has wide variety of applications such as automatic postal mail sorting, number plate recognition and automatic form of reader and entering text from PDA's etc. Cursive script’s Automatic Character Recognition is a complex process facing unique issues unlike other scripts. Many solutions have been proposed in the literature to solve complexities of cursive scripts...
متن کاملThe optical character recognition of Urdu-like cursive scripts
We survey the optical character recognition (OCR) literature with reference to the Urdu-like cursive scripts. In particular, the Urdu, Pushto, and Sindhi languages are discussed, with the emphasis being on the Nasta'liq and Naskh scripts. Before detaining the OCR works, the peculiarities of the Urdu-like scripts are outlined, which are followed by the presentation of the available text image da...
متن کاملNew Approaches for Cursive Languages Recognition: Machine and Hand Written Scripts and Texts
Three different approaches are considered in this paper to deal with the methods of Pattern Classification and Recognition. The main patterns considered are images representing the alphabet of cursive-scripts languages, particularly Arabic alphabet. The practical results of written scripts recognition led to the possibility of applying the main ideas and criteria to written and spoken texts and...
متن کاملTouching Syllable Segmentation using Split Profile Algorithm
The most challenging task of a character recognition system is associated with segmentation of individual components of the script with maximum efficiency. This process is relatively easy with regard to stroke based and standard scripts. Cursive scripts are more complex possessing a large number of overlapping and touching objects, where in the statistical behavior of the topological properties...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1993